System and method for the measurement of the relative position of an object with respect to a point of reference

ABSTRACT

System for the measurement of the relative position of an object with respect to a point of reference, comprising one or more image acquisition subsystems and a processing unit of said acquired images apt to analyze such images according to recognition and localization techniques.

The present invention refers to a system and to a method for the measurement of the relative position of an object with respect to a predetermined point of reference.

An exemplary application of the system and of the method according to the present invention is that of the measurement of the relative position of a ball with respect to a specific line of a field of play.

In several sports, a referee acknowledges an event depending on the relative position of a ball with respect to a specific line of the field of play. E.g., in Soccer a referee awards a score to a team only when the ball has crossed over the goal line.

However, the referee can autonomously note the scoring of a goal solely when the latter is apparent, e.g. due to the swelling out of the netting or to the ball remaining inside of the goal. However, quite frequently the ball crosses over the goal plane and, without touching the net, immediately exits therefrom due to an odd rebound onto the soccer field, a contact with the goal posts, or a player's clearance.

When the ball speed is not overly high (it could even reach the 120 Km/h) the referee, in order to decide on the event at issue, can be aided by assistants, who however should have ideal observing conditions, i.e. be positioned on the goal line having the goal in sight. Otherwise the referee, having to decide one way or the other, runs the risk of awarding a non-existent (phantom) scored goal, or of not awarding an actually scored goal.

To solve this problem, several known systems, all referable to the same category, provide the use of sensors, inserted inside of the goal structure, receiving a signal from transmitters applied into the playing ball when the latter crosses the goal mouth.

However, such systems entail the remarkable disadvantage of being invasive, requiring electronic devices to be inserted in the playing structures (goals and ball). Hence, in order to use the former a general modification to the fields of play and the use of specific balls would be required. These devices are not always applicable, as the modification required to the playing structures could interfere with the laws of the game.

Moreover, a known visual-type system relies instead on the observation of the field of play by suitably positioned cameras. This system determines the position of the vertical projection of the center of mass of the ball onto the plane of the field of play, exploiting the information (known a priori) on the dimensions of the various areas thereof.

However, this system entails the disadvantage of exclusively detecting the ball crossing over a determined line, yet providing no indication about the height above ground of the ball during its crossing, an information crucial in order to confidently claim that the ball has crossed over the goal plane.

A first system for determining the position of an object in a space is known by WO 00/72264. Such system provides for the identification of the object into the images by means of the so-called chroma-key system that operates by comparing areas of the images with a treshold.

Another system is disclosed in EP 0662600. In such case a system is provided for determining the absolute spatial position of an object.

Nevertheless, such system has several drawbacks. In particular the system is only capable of determining the position of an object that moves on a single plan, in a predetermined direction.

Furthermore, the recognition of the object into the acquired images is simply performed by comparison with fixed tresholds, making the result dependent from the environmental conditions.

Purpose of the present invention is to solve the abovecited problems of the known art providing a method for the measurement of the relative position of an object with respect to a point of reference, comprising the following steps:

-   -   acquiring a plurality of images, each of said images being apt         to display said point of reference;     -   processing each image of said plurality of acquired images; and     -   computing said relative position of said object with respect to         said point of reference,     -   characterised in that said step of processing each image fulter         comprises a step of recognizing said object inside each of said         images, said step of recognizing said object being performed by         a classifier.

The present invention further provides a system for the measurement of the relative position of an object with respect to a point of reference comprising:

-   -   one or more image acquisition subsystems, each subsystem being         apt to acquire a plurality of images, each of said images         displaying said point of reference; and     -   a unit for the processing of said acquired images,     -   characterised in that said processing unit comprises a         classifier for recognising said object inside each of said         image.

Hence, a field of play comprising such system could advantageously be provided.

For simplicity's sake, hereinafter reference will still be made to the application of the system in the case of Soccer. Of course, it is umderstood that the described system and method could be useful in any other application entailing the same techiwcal problem.

The main advantage of the method and of the system according to the present invention lies in that those entail no modification to any component of the field of play, or to the ball.

A second advantage lies in the robustness and in the reliability of the detection of the scored goal event. Said method integrates a geometrical measurement of the position of the ball in the three-dimensional space (obtained by a binocular system) to a qualitative assessment of the observed event (obtained with a monocular system) thereby emulating the activity of a human observer enjoying the best observing-conditions. The integration of the measurements provided by said systems ensures a high precision rate and a minimum error probability in any situation, including those of partial ball obstruction, e.g. by one or more players.

A third advantage lies in the completeness of the information on the position of the object with respect to a point of reference, as the method provides the three-dimensional coordinates thereof.

A fourth advantage lies in that the system according to the present invention uses, for image acquisition, high-speed digital cameras, whose performances surpass those of the common cameras as well as those of the human eye.

A further advantage lies in that the system according to the present invention provides an objective digital recording of the event itself, obtained by an advantageous positioning of the cameras and the concomitant acquisition of the observed scene. This recording enable to subsequently review the scene at will, to validate the signaled event (goal/non-goal), as it typically happens in the case of a digital viewer having high time resolution and simultaneous multivision.

Further advantages, features and operation modes of the present invention will be made apparent by the following detailed description of preferred embodiments thereof, given by way of a non-limiting example, making reference to the figures of the attached drawings, wherein:

FIG. 1 is a block diagram of a first embodiment of the system according to the present invention;

FIG. 2 is a block diagram of a monocular subsystem according to the present invention;

FIG. 3 shows an exemplary positioning of the camera, and the related view of the observed scene, for a monocular subsystem according to the present invention;

FIGS. 4 a and 4 b show some images of examples processed during the training step of a Support Vector Machine for classification used in a first embodiment of the monocular subsystem according to the present invention;

FIGS. 5 a and 5 b show some images of examples processed during the training step of a neural network used in a second embodiment of the monocular subsystem according to the present invention;

FIG. 6 is a block diagram of a binocular subsystem according to the present invention;

FIG. 7 sketches the operation principle of the binocular subsystem according to the present invention;

FIG. 8 shows an exemplary positioning of the cameras and the views of the observed scene thereof, for a binocular subsystem according to the present invention;

FIG. 9 shows an exemplary positioning of the cameras and the views of the observed scene thereof, for the system of FIG. 1;

FIG. 10 is the block diagram of a second embodiment of the system according to the present invention;

FIG. 11 shows an exemplary positioning of the cameras with the views of the observed scene thereof, for the system of FIG. 10;

FIG. 12 is the block diagram of a third embodiment of the system according to the present invention;

FIG. 13 shows an exemplary positioning of the cameras with the views of the observed scene thereof, for the system of FIG. 12;

FIG. 14 is the block diagram of a fourth embodiment of the system according to the present invention; and

FIG. 15 shows an exemplary positioning of the cameras with the views of the observed scene thereof, for the system of FIG. 14.

With initial reference to FIG. 1, a first embodiment of the system according to the present invention is shown.

According to this embodiment, the system 1 comprises one or more subsystems 2, 3 apt to concomitantly acquire images of the portion of the field of play at issue comprising the selected point of reference, in particular of the goal area.

The subsystem 2 is of a monocular type, apt to process images acquired from a single position, whereas the subsystem 3 is of a binocular type, apt to process pairs of acquired images from two distinct positions. The operation of the two types of subsystems will be detailed hereinafter.

The monocular subsystem 2 and the binocular subsystem 3 are independent therebetween, each one being apt to provide information data related to the position of the ball when the latter enters the respective visual field.

These data are forwarded to a processing unit 4 comparing them therebetween and computing an end result 5.

Next, FIG. 2 is a block diagram depicting a monocular-type subsystem 2.

The subsystem 2 comprises an image transducer 11, e.g. a camera. This camera 11 should be selected from those enabling to acquire the greatest possible number of images per second, so as to minimize the ball translation As between two successive images. In fact, the camera can detect with certainty the goal-scoring event solely when the ball has crossed the goal line of a distance at least equal to Δs/2.

By way of an example, using a ‘Dalsa Motions Vision progressive Scan Area’—type camera and in particular an CA-D6 0512 model, enabling the acquisition of 262 images/sec with a 536×516 pixel resolution, for a hypothetical ball speed of 120 Kmnth, the subsystem 2 can detect with certainty the goal scoring event when the ball has crossed the goal line of 6.5 cm.

The format of the data outputted by the camera is digital, and it meets the standard EIA-644, i.e. it is apt to be interfaced with other electronic apparatuses for the storing and the processing of the acquired images. In particular, high-speed storing means 12, based e.g. on Ultra 2 SCSI technology with 10000 rpm disks (Seagate) are provided. Thus, all the acquired images are stored in a database 13.

The acquired images are processed by specific computing means 14. Such computing means 14 comprises dedicated systems, like e.g.:

-   -   DATACUBE MAXPCI®, based on an architecture consisting of several         components (convolutor, ALU, Warper, etc.), enabling to perform         an 8×8 convolution step on a 512×512×8 bit image in 7 ms;     -   MATROX GENESYS®, based on a TI C80 DSP multiprocessor         architecture, providing up to three processors for each card, up         to a maximum of six cards, with 100 Bops performances; or     -   ALACRON FANTISMAGE®, based on a PCI multiprocessor (up to eight)         TRIMEDIA 1×100 architecture (Philips Semiconductor) enabling 500         Mflop performances.

Alike architectures are required due to the considerable amount of required computing and for the real-time obtainment of the result.

FIG. 3 shows a preferred positioning for a camera A with respect to the field of play 100 during the use of a monocular subsystem 2 and the image Va thereof acquired by the camera A.

According to this first embodiment of the system 1, the monocular subsystem 2 comprises a camera A positioned with the optical axis lying on the goal line plane in the direction of the goal, apt to autonomously detect the goal scoring event represented by the ball crossing over the goal plane.

Of course, other positionings could be provided in order to attain improved system performances.

The use of a monocular subsystem 2 as the one hereto described enables to implement a qualitative-type method for the detection of the goal scoring event according to the visual information obtained by a camera with no measuring, merely referring the position of the ball observed on each image to several fixed points of reference. In this context, the latter are provided by the goal structure. Hence, hardware and software processing means implement a first qualitative-type decision-making system. The latter, upon being inputted an image, outputs in real-time a signal indicating whether the image be qn instance of the considered event.

According to a first embodiment of the monocular subsystem 2, the processing of the acquired images comprises two essential steps:

-   -   Recognizing the object ball inside of the image; and     -   Locating the ball with respect to predetermined points of         reference.

The recognizing of the object ball can be performed by object recognition technologies based on contour or area analysis.

These methodologies are well known to those skilled in the art, hence a further detailing thereof will be omitted. In particular, according to this first embodiment of the monocular subsystem, a recognition technique based on example learning called SVM (Support Vector Machine) was used.

Next, FIGS. 4 a and 4 b show some sets of exemplary images used to this end.

The ball-recognizing step is performed so as to solve various problems, which may be encountered in a likewise application.

In particular, the difficulties related to the environmental conditions (intensity and direction of the lighting of the taken area, presence of shadows), to the indistinctness of the ball onto the background of the taken scene and to the measurement of ‘false positives’ and/or of ‘false negatives’ are taken into account and overcome.

This problems are tackled and solved by a preprocessing of each acquired image, substantially performed by carrying out software-type procedures aimed at:

-   -   reducing the search area of the object in each image; and     -   extracting identifying information of the object, useful to the         detection and to the localization thereof inside of the image.

In order to solve the abovecited problems, a preprocessing module, acting on an analysis of the gray tones, on the correlation among consecutive images and on a windowing around the position of the preceding image, is provided.

The method according to the present invention provides a training step with examples aimed at implementing a classifier, e.g. of the two-class (ball/non-ball) type, apt to catalogue each image in one of the two classes with the least possible error.

This training step comprises the following steps:

-   -   acquiring a plurality of exemplary images, positive and negative         examples of the object ball to be extracted therefrom;     -   preprocessing of each extracted example, consisting in an         equalizing of the histogram, quantizing the gray level thereof         and masking in order to eliminate the edge effects; and     -   training the classifier by an SVM-type technique. Hence, the         method for the detection of the considered event provides:     -   the continuous acquisition of images of the goal area;     -   for each acquired image, the extraction of all viable subimages         having dimensions equal to that of the examples used during the         SVM ball recognition training step;     -   each subimage is preprocessed as described above and it is         classified as ball or non-ball by the SVM classifier;     -   when a subimage is an instance of the ball, its position in the         image with respect to that of the goalpost (known a priori) is         used to decide whether the goal-scoring event has occurred.

According to a second embodiment of the monocular subsystem 2, the processing of the acquired images provides a training step aimed at implementing a classifier, in particular of the two-class type, apt to catalogue with the least possible error each image in one of the two goal/non-goal classes.

With respect to the previous embodiment, the classifier directly provides the end result, i.e. a decision on the occurrence of the considered event (goal/non-goal).

This training step comprises the following steps:

-   -   acquiring a plurality of images, positive and negative examples         of the goal scoring concept to be extracted therefrom, as         illustrated in the subsequent FIGS. 5 a and 5 b;     -   preprocessing of each extracted example, consisting in an         equalizing of the histogram and a subsequent quantizing of the         gray levels thereof; and     -   training the classifier, by a neural network-based technique.

In this case, the method for the detection of a goal scoring provides:

-   -   the continuous acquisition of images of the goal area;     -   for each acquired image, the extraction of all the viable         subimages having dimensions and typology alike to those of the         examples used during the training step;     -   each subimage is preprocessed as hereto described and it is         classified as goal/non-goal by the neural network.

The neural network used consists of three levels: a first input level, a second intermediate level and a third output level.

The number of nodes of the levels is determined according to the size of the problem at issue. The first level has a number of nodes, defined by the dimension of the input subimages, whereas the output level has a single node, as the problem requires a binary-type (goalnon-goal) response.

The operation principles of a neural network are well known to those skilled in the art, hence will not be detailed hereinafter.

FIGS. 5 a and 5 b show some subimage sets, respectively used as positive and negative examples, of goal scoring during the training step of the decision-making system. These examples substantially consist of real images depicting a side view of the goal with a ball taken in all the possible positions and under different visibility conditions.

During the training step, the neural network classifier learns to recognize the occurrence of the goal-scoring event according to the information contained in the exemplary images provided.

Each taken image is processed by the decision-making system, which returns a response on the actual detection of a goal scoring.

The image sets employed during the classifier training step should cover most of the actual situations which might occur, thereby ensuring a correct operation of the system and reliable responses even under less-than-ideal conditions, e.g. of visibility.

Of course, the classification techniques (SVM and neural network) adopted in the abovedescribed two different embodiments of the monocular subsystem 2 can indifferently be used in the two cases, without thereby changing the overall operation principle of the system. I.e., for the ball recognition also a neural network-based classifier or an SVM-type classifier for the identification of the goal-scoring event could be used.

However, the abovedescribed classification techniques are not the only ones useful in tackling the problem at issue. Actually, any two-class classifier based on an example-learning technique could be used in both embodiments of the monocular subsystem 2 according to the present invention.

FIG. 6 is a block diagram of the binocular subsystem 3.

A pair of image transducers 11, e.g. high-speed cameras of the abovedescribed type, acquires images of the area of the field of play at issue.

These images are stored by storing means 12′ on a file 13′ and processed by computing means 14′.

These computing means are alike those hereto described in connection with the monocular systems, hence will not be detailed anew hereinafter.

With reference to FIG. 7, the binocular subsystem 3 is based onto the measuring of the three-dimensional position of the center of mass B of the ball taken as point of intersection of the lines of sight {R_(i)} generated by two cameras 11 taking the area from different points of sight: B=R ₁ ×R ₂

For each point of sight considered a line of sight R_(i) is generated by the optical center of the camera, intersecting the center of mass B of the ball in the three-dimensional space and the center b_(i) of the projection of the ball on the image plane I_(i).

For each camera, the line of sight intersecting the ball center is determined by the two intersections thereof with the goal plane Π and the field of play plane Γ. These intersections are estimated from the position of the ball on the image plane and from information known a priori on the spatial position of the goalposts and of the lines delimiting the goal area.

In particular, each line of sight is computed as follows:

-   -   estimating a first projective transformation H_(Γi) between a         first known three-dimensional plane, e.g. the plane of the field         Γ, and the projection thereof on the image plane I_(i)         (homography H_(Γi)), from the correspondences of the four         intersection points {P}_(j)=1, . . . ,4 of the four orthogonal         field lines {L_(j)}j=1, . . . ,4 delimiting the goal area, whose         positions in the Euclidean space are known a priori, and the         respective projections {p_(j)}j=1, . . . ,4 in the image I_(i);     -   estimating a second projective transformation H_(Πi) between a         second known three-dimensional plane, e.g. the goal plane Π, and         the projection thereof on the image plane I_(i) (homography         H_(Πi)), from the correspondences of the four points of         intersection {Q_(j)}j=1, . . . ,4 of the two vertical lines,         onto which the outside edges of the two goalposts lie, to the         goal line and to the horizontal line onto which the outside edge         of the crossbar lies, and the respective projections {p_(j)}j=1,         . . . ,4 in the image I_(i);     -   estimating the projection B_(Γi) of the center of mass of the         ball B onto the field of play plane Γ projecting the image b_(i)         thereof with the homography H_(Γi):         B _(Γi) =H _(Γi) b _(i)     -   estimating the projection B_(Πi) of the center of mass of the         ball B onto the goal plane Π projecting thereon the image bi         thereof with the homography H_(Πi):         B _(Πi) =H _(Πi) b _(i);     -   estimating the line of sight R_(i) from the two points of         intersection thereof with the field of play plane B_(Γi) and the         plane onto which the goal B_(Πi) lies:         R _(i) =B _(Γi) ×B _(Πi).

The binocular subsystem requires the following information input:

-   -   two images (I1, I2) referring to two different sights obtained         by two uncalibrated cameras;     -   the coordinates, on any image plane, of the ball center b_(i),         estimated adopting the technique based on example learning with         the SVM algorithm used in the monocular system as well.         Alternatively, any one known pattern recognition technique could         be adopted;     -   the intersections {p_(ij)} of the field of play lines delimiting         the goal area in any image plane;     -   the intersections {q_(ij)} of the lines delimiting the goal in         any image plane;     -   the intersections {P_(j)} of the field of play lines delimiting         the goal area onto the Euclidean plane Γ of the field of play;         and     -   the intersections {Q_(j)} of the lines delimiting the goal         structure in the three-dimensional Euclidean space.

FIG. 8 is a top plan view of the field of play showing an advantageous positioning of the two cameras B, C of the binocular subsystem 3 near the goal, apt to minimize the error in the estimate of the distance of the ball from the goal plane. The cameras acquire images alike those indicated with Vb and Vc in the figure.

FIG. 9 shows the system 1 according to the disclosed first embodiment, providing the combined use of a monocular subsystem 2 (camera A) and of a binocular subsystem 3 (cameras B and C) and the respective views of the acquired images Va, Vb and Vc.

In this case as well, other positions could be provided in order to better adjust the system operation to the specific application.

The binocular subsystem 3 can autonomously detect the goal scoring event, intended as the crossing over of the goal line by the ball, via a second metric-type decision-making system using the measurement of the three-dimensional positioning, yet, generally speaking, it cannot provide a visually assessable confirmation thereof.

The monocular subsystem 2 can autonomously determine only the position of the ball with respect to the goal plane, providing an objective confirmation thereof in the related recording. In fact, without additional sights, a ball having crossed over the goal line yet lying outside of the goal can appear to lie therein.

According to the first embodiment, the system 1 integrates the two subsystems, overcoming the limitations that each one thereof would entail when individually used.

As the monocular subsystem 2 enables to determine solely the position of the ball with respect to the goal plane, the addition of the sights of the binocular subsystem 3 allows to recognize also those crossings of the goal line taking place outside of the goal.

The addition of the monocular subsystem to the binocular one, besides enhancing the reliability of the automated detection of the event, further enables an advantageous visual confirmation thereof.

The binocular 3 and the monocular 2 subsystems independently assess the crossing of the goal line. The redundancy of the assessments provided by the two subsystems increases the reliability of the automated detection of the goal-scoring event. The integration of the results provided by the two hereto described subsystems enables to obtain a single final assessment of the event with the utmost rate of certainty available.

Making reference to FIG. 10, a second embodiment of the system according to the present invention is shown. According to this embodiment, the system 1 comprises two monocular subsystems 2, 2′ of the hereto-described type.

Next, FIG. 11 is a plan view of the field of play 100 onto which the cameras A and D of the monocular subsystems 2 and 2′ are positioned. The cameras A and D acquire images alike those shown in views Va and Vd, respectively.

In particular, the camera D is positioned with the optical axis perpendicular to the goal plane and passing through the center thereof.

The processing of the acquired images by the camera D consists in singling out the ball in the image, inside of the goal structure.

This step is implemented using an SVM-type classifier, referring the position of the ball to the goalposts and crossbar.

Then, this information is integrated to that provided by the monocular subsystem 2, i.e. to the information on the position of the ball with respect to the goal plane enabling the qualitative detection the crossing over of the goal line by the ball.

FIG. 12 shows a third embodiment of the system 1 according to the present invention. According to such third embodiment the system 1 comprises three monocular subsystems 2, 2′ and 2″.

In this case the system 1 provides redundant information, however the redundancy enhances the reliability of the measurement of the scored goal event.

Next, FIG. 13 shows a plan view of the field of play 100 and a preferred positioning of the three cameras A, E and D related to the three monocular subsystems. The views Va, Ve and Vd are examples of images acquired by the three cameras.

The use of redundant information is advantageous in order to solve cases of ball obstruction with respect to one of the two symmetrical cameras A and E.

FIGS. 14 and 15 show a fourth embodiment of the system 1 comprising two binocular subsystems 3, 3′ of the abovedisclosed type.

In this case, four cameras B, C, F and G are positioned onto the field of play 100, so as to provide respective images alike those shown in views Vb, Vc, Vf and Vg. The combined use of the four cameras and the processing of the images provided enhances the reliability of the detection of the scored goal event and reduces the instances of non-detection, caused e.g. by ball obstruction.

The present invention has hereto been described according to preferred embodiments thereof given as non-limiting examples. It is understood that other embodiments may be provided, all however to be construed as falling within the protective scope thereof, as defined by the appended claims. 

1. A method for the measurement of the relative position of an object with respect to a point of reference, comprising the following steps: acquiring a plurality of images, each of said images being apt to display said point of reference; processing each image of said plurality of acquired images; and computing said relative position of said object with respect to said point of reference, characterised in that said step of processing each image further comprises a step of recognizing said object inside each of said images, said step of recognizing said object being performed by a classifier.
 2. The method according to claim 1, wherein said object recognizing step comprises a step of preprocessing each of said images.
 3. The method according to claim 2, wherein said preprocessing step comprises the steps of: analyzing the gray levels of each of said images; performing a windowing of each of said images apt to reduce the search area of said object; extracting identifying information of said object.
 4. The method according to claim 3, wherein said step of extracting said identifying information is performed by techniques based on contour or area analysis.
 5. The method according to any of the claims 1 to 4, wherein said classifier is based on SVM-type techniques.
 6. The method according to any of the claims 1 to 4, wherein said classifier is neural network-based.
 7. The method according to any of the claims 1 to 6, wherein said step of computing said relative position of said object with respect to said point of reference is performed by a first qualitative-type decision-making system.
 8. The method according to any one of the claims 1 to 7, further comprising a step of training with examples, apt to the construction of said classifier.
 9. The method according to claim 8, wherein said step of training with examples comprises the following steps: acquiring a plurality of exemplary images; equalizing the histogram of each of said exemplary images; quantizing the gray levels of each of said exemplary images; and providing each of said exemplary images to said classifier.
 10. The method according to claim 9, wherein each image of said plurality of exemplary images is an instance of said object.
 11. The method according to claim 9, wherein each image of said plurality of exemplary images is an instance of said object with respect to said point of reference.
 12. The method according to any one of the preceding claims, further comprising a step of measuring the three-dimensional position of the center of mass of said object with respect to said point of reference.
 13. The method according to claim 12, wherein said step of measuring comprises a step of computing the point of intersection of the lines of sight generated by two points of observation.
 14. The method according to claim 13, wherein for each of said points of observation, the computation of said line of sight comprises the following steps: estimating a first projective transformation (H_(Γi)) between a first known three-dimensional plane (Γ), and the projection thereof on the image plane (I_(i)); estimating a second projective transformation (H_(Πi)) between a second known three-dimensional plane (Π) and the projection thereof on the image plane (I_(i)); estimating a first projection (B_(Γi)) of the center of mass (B) of said object from said image plane (I_(i)) to said known three-dimensional plane (Γ) by the first projective transformation (H_(Γi)); estimating a second projection (B_(Πi)) of the center of mass (B) of said object from said image plane (I_(i)) to said three-dimensional plane (Π) by the second projective transformation (H_(Πi)); estimating said line of sight, from respective points of intersection with the plane (Γ) and the plane (Π).
 15. The method according to any one of the preceding claims, wherein said step of computing said relative position of said object with respect to said point of reference is performed by a second metric-type decision-making system.
 16. The method according to any one of the preceding claims, further comprising a step of storing each of said acquired images.
 17. A system for the measurement of the relative position of an object with respect to a point of reference comprising: one or more image acquisition subsystems (2, 2′, 2″, 3, 3′), each subsystem being apt to acquire a plurality of images, each of said images displaying said point of reference; and a unit (4) for the processing of said acquired images, characterised in that said processing unit (4) comprises a classifier for recognising said object inside each of said image.
 18. The system according to claim 17, wherein at least one of said one or more image acquisition subsystems is a monocular-type subsystem (2, 2′, 2″).
 19. The system according to claim 18, wherein each of said monocular subsystem (2, 2′, 2″) comprises a single image transducer (11).
 20. The system according to any one of the claims 17 to 19, wherein at least one of said one or more image acquisition subsystems is a binocular-type subsystem (3, 3′).
 21. The system according to claim 20, wherein each binocular-type subsystem (3, 3′) comprises two image transducers (11).
 22. The system according to any one of the claims 19 to 21, wherein each of said image transducers is a camera (11).
 23. The system according to claim 22, wherein each camera is apt to acquire at least 262 images/sec, with a resolution of at least 512×512 pixels.
 24. The system according to any one of the claims 17 to 23, further comprising means (12,13;12′,13′) for storing said acquired images.
 25. The system according to any one of the claims 17 to 24, wherein each image acquisition subsystem (2, 2′, 2″, 3, 3′) further comprises computing means (14;14′) apt to compute said relative position of said object with respect to said point of reference.
 26. The system according to any one of the claims 17 to 25, apt to implement a method according to any one of the claims 1 to
 16. 27. A field of play (100) comprising a system for the measurement of the relative position of an object with respect to a point of reference according to any one of the claims 17 to
 26. 